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(1) GENERAL INFORMATION: 

(i) APPLICANT: S vends en, Allan 

Bisgard-Frantzen, Henrik 
Borchert, Torben Vedel 

(ii) TITLE OF INVENTION: a- Amylase Mutants 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk of North America, Inc. 

(B) STREET: 405 Lexington Avenue, 64th Floor 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States of America 

(F) ZIP: 10174-6401 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/683,838 

(B) FILING DATE: 18-JUL-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Green, Reza 

(B) REGISTRATION NUMBER: 38,475 

(C) REFERENCE /DOCKET NUMBER : 4394.400-US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-867-0123 

(B) TELEFAX: 212-878-9655 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 334.. 1869 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 334.. 420 

(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 421.. 1869 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CGGAAGATTG GAAGTACAAA AATAAGCAAA AG ATTGT C AA TCATGTCATG AGCCATGCGG 6 0 
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GAGACGGAAA AATCGTCTTA ATGCACGATA TTTATGCAAC GTTCG CAG AT GCTGCTGAAG 12 0 

AGATTATTAA AAAGC TGAAA GCAAAAGGCT ATCAATTGGT AACTGTATCT CAGCTTGAAG 18 0 

AAGTGAAGAA GCAGAGAGGC TATTGAATAA ATGAGTAGAA GCGCCATATC GGCGCTTTTC 24 0 

TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT TTATACAACA 3 00 

TCATATGTTT CACATTGAAA GGGGAGGAGA ATC ATG AAA CAA CAA AAA CGG CTT 3 54 

Met Lys Gin Gin Lys Arg Leu 
-29 -25 

TAC GCC CGA TTG CTG ACG CTG TTA TTT GCG CTC ATC TTC TTG CTG CCT 4 02 

Tyr Ala Arg Leu Leu Thr Leu Leu Phe Ala Leu lie Phe Leu Leu Pro 
-20 -15 -10 

CAT TCT GCA GCA GCG GCG GCA AAT CTT AAT GGG ACG CTG ATG CAG TAT 4 50 

His Ser Ala Ala Ala Ala Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr 
-5 15 10 

TTT GAA TGG TAC ATG CCC AAT GAC GGC CAA CAT TGG AGG CGT TTG CAA 4 98 

Phe Glu Trp Tyr Met Pro Asn Asp Gly Gin His Trp Arg Arg Leu Gin 
15 20 25 

AAC GAC TCG GCA TAT TTG GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT 546 
Asn Asp Ser Ala Tyr Leu Ala Glu His Gly lie Thr Ala Val Trp lie 
30 35 40 

CCC CCG GCA TAT AAG GGA ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT 5 94 

Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala 
45 50 55 

TAC GAC CTT TAT GAT TTA GGG GAG TTT CAT CAA AAA GGG ACG GTT CGG 642 
Tyr Asp Leu Tyr Asp Leu Gly Glu Phe His Gin Lys Gly Thr Val Arg 
60 65 70 

ACA AAG TAC GGC ACA AAA GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT 6 90 

Thr Lys Tyr Gly Thr Lys Gly Glu Leu Gin Ser Ala lie Lys Ser Leu 
75 80 85 90 

CAT TCC CGC GAC ATT AAC GTT TAC GGG GAT GTG GTC ATC AAC CAC AAA 738 
His Ser Arg Asp lie Asn Val Tyr Gly Asp Val Val lie Asn His Lys 
95 100 105 

GGC GGC GCT GAT GCG ACC GAA GAT GTA ACC GCG GTT GAA GTC GAT CCC 786 
Gly Gly Ala Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asp Pro 
110 115 120 

GCT GAC CGC AAC CGC GTA ATT TCA GGA GAA CAC CTA ATT AAA GCC TGG 8 34 

Ala Asp Arg Asn Arg Val lie Ser Gly Glu His Leu lie Lys Ala Trp 
125 130 135 

ACA CAT TTT CAT TTT CCG GGG CGC GGC AGC ACA TAC AGC GAT TTT AAA 8 82 

Thr His Phe His Phe Pro Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys 
140 145 150 

TGG CAT TGG TAC CAT TTT GAC GGA ACC GAT TGG GAC GAG TCC CGA AAG 93 0 

Trp His Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys 
155 160 165 170 

CTG AAC CGC ATC TAT AAG TTT CAA GGA AAG GCT TGG GAT TGG GAA GTT 978 
Leu Asn Arg lie Tyr Lys Phe Gin Gly Lys Ala Trp Asp Trp Glu Val 
175 180 185 

TCC AAT GAA AAC GGC AAC TAT GAT TAT TTG ATG TAT GCC GAC ATC GAT 102 6 

Ser Asn Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp 
190 195 200 
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TAT GAC CAT CCT GAT GTC GCA GCA GAA ATT AAG AGA TGG GGC ACT TGG 10 74 

Tyr Asp His Pro Asp Val Ala Ala Glu lie Lys Arg Trp Gly Thr Trp 
ft 205 210 215 

TAT GCC AAT GAA CTG CAA TTG GAC GGT TTC CGT CTT GAT GCT GTC AAA 112 2 

Tyr Ala Asn Glu Leu Gin Leu Asp Gly Phe Arg Leu Asp Ala Val Lys 
220 225 230 

CAC ATT AAA TTT TCT TTT TTG CGG GAT TGG GTT AAT CAT GTC AGG GAA 117 0 

His lie Lys Phe Ser Phe Leu Arg Asp Trp Val Asn His Val Arg Glu 
235 240 245 250 

AAA ACG GGG AAG GAA ATG TTT ACG GTA GCT GAA TAT TGG CAG AAT GAC 1218 
Lys Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp 
255 260 265 

TTG GGC GCG CTG GAA AAC TAT TTG AAC AAA ACA AAT TTT AAT CAT TCA 12 6 6 

Leu Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Phe Asn His Ser 
270 275 280 

GTG TTT GAC GTG CCG CTT CAT TAT CAG TTC CAT GCT GCA TCG ACA CAG 1314 
Val Phe Asp Val Pro Leu His Tyr Gin Phe His Ala Ala Ser Thr Gin 
285 290 295 

GGA GGC GGC TAT GAT ATG AGG AAA TTG CTG AAC GGT ACG GTC GTT TCC 13 6 2 

Gly Gly Gly Tyr Asp Met Arg Lys Leu Leu Asn Gly Thr Val Val Ser 
300 305 310 

AAG CAT CCG TTG AAA TCG GTT ACA TTT GTC GAT AAC CAT GAT ACA CAG 1410 
Lys His Pro Leu Lys Ser Val Thr Phe Val Asp Asn His Asp Thr Gin 
315 320 325 330 

CCG GGG CAA TCG CTT GAG TCG ACT GTC CAA ACA TGG TTT AAG CCG CTT 14 5 8 

Pro Gly Gin Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu 
335 340 345 

GCT TAC GCT TTT ATT CTC ACA AGG GAA TCT GGA TAC CCT CAG GTT TTC 15 06 

Ala Tyr Ala Phe lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe 
350 355 360 

TAC GGG GAT ATG TAC GGG ACG AAA GGA GAC TCC CAG CGC GAA ATT CCT 15 54 

Tyr Gly Asp Met Tyr Gly Thr Lys Gly Asp Ser Gin Arg Glu lie Pro 
365 370 375 

GCC TTG AAA CAC AAA ATT GAA CCG ATC TTA AAA GCG AGA AAA CAG TAT 16 02 

Ala Leu Lys His Lys lie Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr 
380 385 390 

GCG TAC GGA GCA CAG CAT GAT TAT TTC GAC CAC CAT GAC ATT GTC GGC 16 5 0 

Ala Tyr Gly Ala Gin His Asp Tyr Phe Asp His His Asp lie Val Gly 
395 400 405 410 

TGG ACA AGG GAA GGC GAC AGC TCG GTT GCA AAT TCA GGT TTG GCG GCA 16 98 

Trp Thr Arg Glu Gly Asp Ser Ser Val Ala Asn Ser Gly Leu Ala Ala 
415 420 425 

TTA ATA ACA GAC GGA CCC GGT GGG GCA AAG CGA ATG TAT GTC GGC CGG 174 6 

Leu lie Thr Asp Gly Pro Gly Gly Ala Lys Arg Met Tyr Val Gly Arg 
430 435 440 

CAA AAC GCC GGT GAG ACA TGG CAT GAC ATT ACC GGA AAC CGT TCG GAG 17 94 

Gin Asn Ala Gly Glu Thr Trp His Asp lie Thr Gly Asn Arg Ser Glu 
445 450 455 

CCG GTT GTC ATC AAT TCG GAA GGC TGG GGA GAG TTT CAC GTA AAC GGC 184 2 

Pro Val Val lie Asn Ser Glu Gly Trp Gly Glu Phe His Val Asn Gly 
460 465 470 
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GGG TCG GTT TCA ATT TAT GTT CAA AGA TAGAAGAGCA GAGAGGACGG 18 8 9 

Gly Ser Val Ser lie Tyr Val Gin Arg 
475 480 

ATTTCCTGAA GGAAATCCGT TTTTTTATTT T 192 0 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Gin Gin Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 
-29 -25 -20 -15 

Ala Leu lie Phe Leu Leu Pro His Ser Ala Ala Ala Ala Ala Asn Leu 
-10 -5 1 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro Asn Asp Gly 
5 10 15 

Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu Ala Glu His 
20 25 30 35 

Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Thr Ser Gin 
40 45 50 

Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
55 60 65 

His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu Leu 
70 75 80 

Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn Val Tyr Gly 
85 90 95 

Asp Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 115 

Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val lie Ser Gly 
120 125 130 

Glu His Leu lie Lys Ala Trp Thr His Phe His Phe Pro Gly Arg Gly 
135 140 145 

Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Thr 
150 155 160 

Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg lie Tyr Lys Phe Gin Gly 
165 170 175 

Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn Tyr Asp Tyr 
180 185 190 195 

Leu Met Tyr Ala Asp lie Asp Tyr Asp His Pro Asp Val Ala Ala Glu 
200 205 210 

lie Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin Leu Asp Gly 
215 220 225 



74 



Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Leu Arg Asp 
230 235 240 

Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met Phe Thr Val 
245 250 255 

Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn Tyr Leu Asn 
260 265 270 275 

Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His Tyr Gin 
280 285 290 

Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met Arg Lys Leu 
295 300 305 

Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser Val Thr Phe 
310 315 320 

Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser Thr Val 
325 * 330 335 

Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr Arg Glu 
340 345 350 355 

Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys Gly 
360 365 370 

Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys lie Glu Pro lie 
375 380 385 

Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His Asp Tyr Phe 
390 395 400 

Asp His His Asp lie Val Gly Trp Thr Arg Glu Gly Asp Ser Ser Val 
405 410 415 

Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ala 
420 425 430 435 

Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr Trp His Asp 
440 445 450 

lie Thr Gly Asn Arg Ser Glu Pro Val Val lie Asn Ser Glu Gly Trp 
455 460 465 

Gly Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr Val Gin Arg 
470 475 480 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 250.. 1791 

(ix) FEATURE: 

(A) NAME / KEY : sig_peptide 

(B) LOCATION: 250.. 342 
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(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 
<B) LOCATION: 343.. 1791 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GCCCCGCACA TACGAAAAGA CTGGCTGAAA ACATTGAGCC TTTGATGACT GATGATTTGG 6 0 

CTGAAGAAGT GGATCGATTG TTTGAGAAAA GAAGAAGACC ATAAAAATAC CTTGTCTGTC 12 0 

AT C AG AC AGG GTATTTTTTA TGCTGTCCAG ACTGTCCGCT GTGTAAAAAT AAGGAATAAA 18 0 

GGGGGGTTGT TATTATTTTA CTGATATGTA AAATATAATT TGTATAAGAA AATGAGAGGG 24 0 

AGAGGAAAC ATG ATT CAA AAA CGA AAG CGG ACA GTT TCG TTC AGA CTT 2 88 

Met lie Gin Lys Arg Lys Arg Thr Val Ser Phe Arg Leu 
-31 -30 -25 -20 

GTG CTT ATG TGC ACG CTG TTA TTT GTC AGT TTG CCG ATT ACA AAA ACA 3 36 

Val Leu Met Cys Thr Leu Leu Phe Val Ser Leu Pro lie Thr Lys Thr 
-15 -10 -5 

TCA GCC GTA AAT GGC ACG CTG ATG CAG TAT TTT GAA TGG TAT ACG CCG 3 84 

Ser Ala Val Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro 
15 10 

AAC GAC GGC CAG CAT TGG AAA CGA TTG CAG AAT GAT GCG GAA CAT TTA 432 
Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu 
15 20 25 30 

TCG GAT ATC GGA ATC ACT GCC GTC TGG ATT CCT CCC GCA TAC AAA GGA 480 
Ser Asp lie Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly 
35 40 45 

TTG AGC CAA TCC GAT AAC GGA TAC GGA CCT TAT GAT TTG TAT GAT TTA 52 8 

Leu Ser Gin Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu 
50 55 60 

GGA GAA TTC CAG CAA AAA GGG ACG GTC AGA ACG AAA TAC GGC ACA AAA 576 
Gly Glu Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 

TCA GAG CTT CAA GAT GCG ATC GGC TCA CTG CAT TCC CGG AAC GTC CAA 62 4 

Ser Glu Leu Gin Asp Ala lie Gly Ser Leu His Ser Arg Asn Val Gin 
80 85 90 

GTA TAC GGA GAT GTG GTT TTG AAT CAT AAG GCT GGT GCT GAT GCA ACA 6 72 

Val Tyr Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr 
95 100 105 110 

GAA GAT GTA ACT GCC GTC GAA GTC AAT CCG GCC AAT AGA AAT CAG GAA 72 0 

Glu Asp Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu 
115 120 125 

ACT TCG GAG GAA TAT CAA ATC AAA GCG TGG ACG GAT TTT CGT TTT CCG 76 8 

Thr Ser Glu Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro 
130 135 140 

GGC CGT GGA AAC ACG TAC AGT GAT TTT AAA TGG CAT TGG TAT CAT TTC 816 
Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 

GAC GGA GCG GAC TGG GAT GAA TCC CGG AAG ATC AGC CGC ATC TTT AAG 864 
Asp Gly Ala Asp Trp Asp Glu Ser Arg Lys lie Ser Arg He Phe Lys 
160 165 170 

TTT CGT GGG GAA GGA AAA GCG TGG GAT TGG GAA GTA TCA AGT GAA AAC 912 
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Phe Arg Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn 
175 180 185 190 

GGC AAC TAT GAC TAT TTA ATG TAT GCT GAT GTT GAC TAC GAC CAC CCT 96 0 

Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro 
195 200 205 

GAT GTC GTG GCA GAG ACA AAA AAA TGG GGT ATC TGG TAT GCG AAT GAA 100 8 

Asp Val Val Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu 
210 215 220 

CTG TCA TTA GAC GGC TTC CGT ATT GAT GCC GCC AAA CAT ATT AAA TTT 105 6 

Leu Ser Leu Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe 
225 230 235 

TCA TTT CTG CGT GAT TGG GTT CAG GCG GTC AGA CAG GCG ACG GGA AAA 1104 
Ser Phe Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys 
240 245 250 

GAA ATG TTT ACG GTT GCG GAG TAT TGG CAG AAT AAT GCC GGG AAA CTC 1152 
Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu 
255 260 265 270 

GAA AAC TAC TTG AAT AAA ACA AGC TTT AAT CAA TCC GTG TTT GAT GTT 12 0 0 

Glu Asn Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val 
275 280 285 

CCG CTT CAT TTC AAT TTA CAG GCG GCT TCC TCA CAA GGA GGC GGA TAT 12 4 8 

Pro Leu His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr 
290 295 300 

GAT ATG AGG CGT TTG CTG GAC GGT ACC GTT GTG TCC AGG CAT CCG GAA 12 96 

Asp Met Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu 
305 310 315 

AAG GCG GTT ACA TTT GTT GAA AAT CAT GAC ACA CAG CCG GGA CAG TCA 1344 
Lys Ala Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser 
320 325 330 

TTG GAA TCG ACA GTC CAA ACT TGG TTT AAA CCG CTT GCA TAC GCC TTT 13 92 

Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe 
335 340 345 350 

ATT TTG ACA AGA GAA TCC GGT TAT CCT CAG GTG TTC TAT GGG GAT ATG 144 0 

lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met 
355 360 365 

TAC GGG ACA AAA GGG ACA TCG CCA AAG GAA ATT CCC TCA CTG AAA GAT 14 8 8 

Tyr Gly Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp 
370 375 380 

AAT ATA GAG CCG ATT TTA AAA GCG CGT AAG GAG TAC GCA TAC GGG CCC 153 6 

Asn lie Glu Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala Tyr. Gly Pro 
385 390 395 

CAG CAC GAT TAT ATT GAC CAC CCG GAT GTG ATC GGA TGG ACG AGG GAA 15 84 

Gin His Asp Tyr lie Asp His Pro Asp Val lie Gly Trp Thr Arg Glu 
400 405 410 

GGT GAC AGC TCC GCC GCC AAA TCA GGT TTG GCC GCT TTA ATC ACG GAC 163 2 

Gly Asp Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu lie Thr Asp 
415 420 425 430 

GGA CCC GGC GGA TCA AAG CGG ATG TAT GCC GGC CTG AAA AAT GCC GGC 16 8 0 

Gly Pro Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly 
435 440 445 
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GAG ACA TGG TAT GAC ATA ACG GGC AAC CGT TCA GAT ACT GTA AAA ATC 172 8 

Glu Thr Trp Tyr Asp lie Thr Gly Asn Arg Ser Asp Thr Val Lys lie 
450 455 460 

GGA TCT GAC GGC TGG GGA GAG TTT CAT GTA AAC GAT GGG TCC GTC TCC 17 76 

Gly Ser Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser 
465 470 475 

ATT TAT GTT CAG AAA TAAGGTAATA AAAAAACACC TCCAAGCTGA GTGCGGGTAT 18 31 
lie Tyr Val Gin Lys 
480 

CAGCTTGGAG GTGCGTTTAT TTTTTCAGCC GTATGACAAG GTCGGCATCA GGTGTGACAA 18 91 

ATACGGTATG CTGGCTGTCA TAGGTGACAA ATCCGGGTTT TGCGCCGTTT GGCTTTTTCA 1951 

CATGTCTGAT TTTTGTATAA TCAACAGGCA CGGAGCCGGA ATCTTTCGCC TTGGAAAAAT 2 011 

AAGCGGCGAT CGTAGCTGCT TCCAATATGG ATTGTTCATC GGGATCGCTG CTTTTAATCA 2 071 

CAACGTGGGA TCC 2 084 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met lie Gin Lys Arg Lys Arg Thr Val Ser Phe Arg Leu Val Leu Met 

-31 -30 -25 -20 

Cys Thr Leu Leu Phe Val Ser Leu Pro lie Thr Lys Thr Ser Ala Val 

-15 -10 -5 1 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp Gly 
5 10 .15 

Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu Ser Asp lie 
2 0 2 5 3 0 

Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Leu Ser Gin 
35 40 45 

Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
50 55 60 65 

Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu Leu 
70 75 80 

Gin Asp Ala lie Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr Gly 
85 90 95 

Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 

Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser Glu 
115 120 125 

Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg Gly 
130 135 140 145 
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Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Ala 
150 155 160 

Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys Phe Arg Gly 
) 165 170 175 

Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn Tyr 
180 185 190 

Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val Val 
195 200 205 

Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu Leu Ser Leu 
210 215 220 225 

Asp Gly Phe Arg He Asp Ala Ala Lys His He Lys Phe Ser Phe Leu 
230 235 240 

Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met Phe 
245 250 255 

Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn Tyr 
260 265 270 

Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu His 
275 280 285 

Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met Arg 
290 295 300 305 

Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala Val 
310 315 320 

Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser 
325 330 335 

Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr 
340 345 350 

Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr 
355 360 365 

Lys Gly Thr Ser Pro Lys Glu He Pro Ser Leu Lys Asp Asn He Glu 
370 375 380 385 

Pro He Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His Asp 
390 395 400 

Tyr He Asp His Pro Asp Val He Gly Trp Thr Arg Glu Gly Asp Ser 
405 410 415 

Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro Gly 
420 425 430 

Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr Trp 
435 440 445 

Tyr Asp He Thr Gly Asn Arg Ser Asp Thr Val Lys He Gly Ser Asp 
450 455 460 465 

Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser He Tyr Val 
470 475 480 

Gin Lys 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 156 .. 1802 

(ix) FEATURE: 

(A) NAME / KEY : sig_peptide 

(B) LOCATION: 156.. 2 57 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 258.. 1802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AAATTCGATA TTGAAAACGA TTACAAATAA AAATTATAAT AGACGTAAAC GTTCGAGGGT 6 0 

TTGCTCCCTT TTTACTCTTT TTATGCAATC GTTTCCCTTA ATTTTTTGGA AGCCAAACCG 12 0 

TCGAATGTAA CATTTGATTA AGGGGGAAGG GCATT GTG CTA ACG TTT CAC CGC 173 

Val Leu Thr Phe His Arg 
-34 -30 

ATC ATT CGA AAA GGA TGG ATG TTC CTG CTC GCG-TTT TTG CTC ACT GTC 221 
lie lie Arg Lys Gly Trp Met Phe Leu Leu Ala Phe Leu Leu Thr Val 
-25 -20 -15 

TCG CTG TTC TGC CCA ACA GGA CAG CCC GCC AAG GCT GCC GCA CCG TTT 26 9 

Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala Lys Ala Ala Ala Pro Phe 
-10 -5 1 

AAC GGC ACC ATG ATG CAG TAT TTT GAA TGG TAC TTG CCG GAT GAT GGC 317 
Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu Pro Asp Asp Gly 
5 10 15 20 

ACG TTA TGG ACC AAA GTG GCC AAT GAA GCC AAC AAC TTA TCC AGC CTT 36 5 

Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn Leu Ser Ser Leu 
25 30 35 

GGC ATC ACC GCT CTT TGG CTG CCG CCC GCT TAC AAA GGA ACA AGC CGC 413 
Gly lie Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys Gly Thr Ser Arg 
40 45 50 

AGC GAC GTA GGG TAC GGA GTA TAC GAC TTG TAT GAC CTC GGC GAA TTC 461 
Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
55 60 65 

AAT CAA AAA GGG ACC GTC CGC ACA AAA TAC GGA ACA AAA GCT CAA TAT 5 09 

Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr 
70 75 80 

CTT CAA GCC ATT CAA GCC GCC CAC GCC GCT GGA ATG CAA GTG TAC GCC 5 57 

Leu Gin Ala lie Gin Ala Ala His Ala Ala Gly Met Gin Val Tyr Ala 
85 90 95 100 

GAT GTC GTG TTC GAC CAT AAA GGC GGC GCT GAC GGC ACG GAA TGG GTG 6 05 

Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly Thr Glu Trp Val 
105 110 115 
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GAC GCC GTC GAA GTC AAT CCG TCC GAC CGC AAC CAA GAA ATC TCG GGC 6 53 

Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin Glu lie Ser Gly 
120 125 130 

ACC TAT CAA ATC CAA GCA TGG ACG AAA TTT GAT TTT CCC GGG CGG GGC 7 01 

Thr Tyr Gin lie Gin Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly 
135 140 145 

AAC ACC TAC TCC AGC TTT AAG TGG CGC TGG TAC CAT TTT GAC GGC GTT 74 9 

Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Val 
150 155 160 

GAT TGG GAC GAA AGC CGA AAA TTG AGC CGC ATT TAC AAA TTC CGC GGC 797 
Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg lie Tyr Lys Phe Arg Gly 
165 170 175 180 

ATC GGC AAA GCG TGG GAT TGG GAA GTA GAC ACG GAA AAC GGA AAC TAT 84 5 

lie Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn Tyr 
185 190 195 

GAC TAC TTA ATG TAT GCC GAC CTT GAT ATG GAT CAT CCC GAA GTC GTG 8 93 

Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His Pro Glu Val Val 
200 205 210 

ACC GAG CTG AAA AAC TGG GGG AAA TGG TAT GTC AAC ACA ACG AAC ATT 941 
Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn Thr Thr Asn lie 
215 220 225 

GAT GGG TTC CGG CTT GAT GCC GTC AAG CAT ATT AAG TTC AGT TTT TTT 98 9 

Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Phe 
230 235 240 

CCT GAT TGG TTG TCG TAT GTG CGT TCT CAG ACT GGC AAG CCG CTA TTT 103 7 

Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly Lys Pro Leu Phe 
245 250 255 260 

ACC GTC GGG GAA TAT TGG AGC TAT GAC ATC AAC AAG TTG CAC AAT TAC 108 5 

Thr Val Gly Glu Tyr Trp Ser Tyr Asp lie Asn Lys Leu His Asn Tyr 
265 270 275 

ATT ACG AAA ACA GAC GGA ACG ATG TCT TTG TTT GAT GCC CCG TTA CAC 113 3 

lie Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp Ala Pro Leu His 
280 285 290 

AAC AAA TTT TAT ACC GCT TCC AAA TCA GGG GGC GCA TTT GAT ATG CGC 1181 
Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala Phe Asp Met Arg 
295 300 305 

ACG TTA ATG ACC AAT ACT CTC ATG AAA GAT CAA CCG ACA TTG GCC GTC 122 9 

Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro Thr Leu Ala Val 
310 315 320 

ACC TTC GTT GAT AAT CAT GAC ACC GAA CCC GGC CAA GCG CTG CAG TCA 12 7 7 

Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin Ala Leu Gin Ser 
325 330 335 340 

TGG GTC GAC CCA TGG TTC AAA CCG TTG GCT TAC GCC TTT ATT CTA ACT 132 5 

Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr 
345 350 355 

CGG CAG GAA GGA TAC CCG TGC GTC TTT TAT GGT GAC TAT TAT GGC ATT 13 7 3 

Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly lie 
360 365 370 
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CCA CAA TAT AAC ATT CCT TCG CTG AAA AGC AAA ATC GAT CCG CTC CTC 14 21 

Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys lie Asp Pro Leu Leu 
375 380 385 

ATC GCG CGC AGG GAT TAT GCT TAC GGA ACG CAA CAT GAT TAT CTT GAT 14 6 9 

lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His Asp Tyr Leu Asp 
390 395 400 

CAC TCC GAC ATC ATC GGG TGG ACA AGG GAA GGG GGC ACT GAA AAA CCA 1517 
His Ser Asp lie lie Gly Trp Thr Arg Glu Gly Gly Thr Glu Lys Pro 
405 410 415 420 

GGA TCC GGA CTG GCC GCA CTG ATC ACC GAT GGG CCG GGA GGA AGC AAA 156 5 

Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ser Lys 
425 430 435 

TGG ATG TAC GTT GGC AAA CAA CAC GCT GGA AAA GTG TTC TAT GAC CTT 1613 
Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val Phe Tyr Asp Leu 
440 445 450 . 

ACC GGC AAC CGG AGT GAC ACC GTC ACC ATC AAC AGT GAT GGA TGG GGG 16 61 

Thr Gly Asn Arg Ser Asp Thr Val Thr lie Asn Ser Asp Gly Trp Gly 
455 460 465 

GAA TTC AAA GTC AAT GGC GGT TCG GTT TCG GTT TGG GTT CCT AGA AAA 1709 
Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp Val Pro Arg Lys 
470 475 480 

ACG ACC GTT TCT ACC ATC GCT CGG CCG ATC ACA ACC CGA CCG TGG ACT 17 5 7 

Thr Thr Val Ser Thr lie Ala Arg Pro lie Thr Thr Arg Pro Trp Thr 
485 490 495 500 

GGT GAA TTC GTC CGT TGG ACC GAA CCA CGG TTG GTG GCA TGG CCT 18 0 2 

Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val Ala Trp Pro 
505 510 515 

TGATGCCTGC GA 1814 

(2) INFORMATION FOR SEQ ID NO : 6 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Val Leu Thr Phe His Arg lie lie Arg Lys Gly Trp Met Phe Leu Leu 
-34 -30 -25 -20 

Ala Phe Leu Leu Thr Val Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala 
-15 -10 -5 

Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp 
15 10 

Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 
15 20 25 30 

Asn Asn Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro Pro Ala 
35 40 45 

Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 
50 55 60 
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Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr 
65 70 75 

Gly Thr Lys Ala Gin Tyr Leu Gin Ala lie Gin Ala Ala His Ala Ala 
80 85 90 

Gly Met Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala 
95 100 105 110 

Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 
115 120 125 

Asn Gin Glu lie Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe 
130 135 140 

Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp 
145 150 155 

Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg 
160 165 170 

He Tyr Lys Phe Arg Gly lie Gly Lys Ala Trp Asp Trp Glu Val Asp 
175 180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met 
195 200 , 205 

Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr 
210 215 220 

Val Asn Thr Thr Asn lie Asp Gly Phe Arg Leu Asp Ala Val Lys His 
225 230 235 

He Lys Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin 
240 245 250 

Thr Gly Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He 
255 260 265 270 

Asn Lys Leu His Asn Tyr He Thr Lys Thr Asp Gly Thr Met Ser Leu 
275 280 285 

Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly 
290 295 300 

Gly Ala Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp 
305 310 315 

Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro 
320 325 330 

Gly Gin Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala 
335 340 345 350 

Tyr Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser 
370 375 380 

Lys He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr 
385 390 395 

Gin His Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu 
400 405 410 
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Gly Gly Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu lie Thr Asp 
415 420 425 430 

Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly 
435 440 445 

Lys Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr lie 
450 455 460 

Asn Ser Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser 
465 470 475 

Val Trp Val Pro Arg Lys Thr Thr Val Ser Thr lie Ala Arg Pro lie 
480 485 490 

Thr Thr Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg 
495 500 505 510 

Leu Val Ala Trp Pro - 
515 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGTCGTAGGC ACCGTAGCCC CAATCCGCTT G 31 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGTCGTAGGC ACCGTAGCCC CAATCCCATT GGCTCG 3 6 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CTGTGACTGG TGAGTACTCA ACCAAGTC 2 8 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Thr Pro Ala Asp Trp Arg Ser Gin Ser lie Tyr Phe Leu Leu Thr 
15 10 15 

Asp Arg Phe Ala Arg Thr Asp Gly Ser Thr Thr Ala Thr Cys Asn Thr 
20 25 30 

Ala Asp Gin Lys Tyr Cys Gly Gly Thr Trp Gin Gly He He Asp Lys 
35 40 45 

Leu Asp Tyr He Gin Gly Met Gly Phe Thr Ala He Trp He Thr Pro 
50 55 60 

Val Thr Ala Gin Leu Pro Gin Thr Thr Ala Tyr Gly Asp Ala Tyr His 
65 70 75 80 

Gly Tyr Trp Gin Gin Asp He Tyr Ser Leu Asn Glu Asn Tyr Gly Thr 
85 90 95 

Ala Asp Asp Leu Lys Ala Leu Ser Ser Ala Leu His Glu Arg Gly Met 
100 105 110 

Tyr Leu Met Val Asp Val Val Ala Asn His Met Gly Tyr Asp Gly Ala 
115 120 125 

Gly Ser Ser Val Asp Tyr Ser Val Phe Lys Pro Phe Ser Ser Gin Asp 
130 135 140 

Tyr Phe His Pro Phe Cys Phe He Gin Asn Tyr Glu Asp Gin Thr Gin 
145 150 155 160 

Val Glu Asp Cys Trp Leu Gly Asp Asn Thr Val Ser Leu Pro Asp Leu 
165 170 175 

Asp Thr Thr Lys Asp Val Val Lys Asn Glu Trp Tyr Asp Trp Val Gly 
180 185 190 

Ser Leu Val Ser Asn Tyr Ser He Asp Gly Leu Arg He Asp Thr Val 
195 200 205 

Lys His Val Gin Lys Asp Phe Trp Pro Gly Tyr Asn Lys Ala Ala Gly 
210 215 220 

Val Tyr Cys He Gly Glu Val Leu Asp Gly Asp Pro Ala Tyr Thr Cys 
225 230 235 240 

Pro Tyr Gin Asn Val Met Asp Gly Val Leu Asn Tyr Pro He Tyr Tyr 
245 250 255 

Pro Leu Leu Asn Ala Phe Lys Ser Thr Ser Gly Ser Met Asp Asp Leu 
260 265 270 

Tyr Asn Met He Asn Thr Val Lys Ser Asp Cys Pro Asp Ser Thr Leu 
275 280 285 
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Leu Gly Thr Phe Val Glu Asn His Asp Asn Pro Arg Phe Ala Ser Tyr 
290 295 300 

Thr Asn Asp lie Ala Leu Ala Lys Asn Val Ala Ala Phe lie lie Leu 
305 310 315 320 

Asn Asp Gly lie Pro lie lie Tyr Ala Gly Gin Glu Gin His Tyr Ala 
325 330 335 

Gly Gly Asn Asp Pro Ala Asn Arg Glu Ala Thr Trp Leu Ser Gly Tyr 
340 345 350 

Pro Thr Asp Ser Glu Leu Tyr Lys Leu lie Ala Ser Ala Asn Ala lie 
355 360 365 

Arg Asn Tyr Ala lie Ser Lys Asp Thr Gly Phe Val Thr Tyr Lys Asn 
370 375 380 

Trp Pro lie Tyr Lys Asp Asp lie Thr lie Ala Met Arg Lys Gly Thr 
385 390 395 400 

Asp Gly Ser Gin lie Val Thr lie Leu Ser Asn Lys Gly Ala Ser Gly 
405 410 415 

Asp Ser Tyr Thr Leu Ser Leu Ser Gly Ala Gly Tyr Thr Ala Gly Gin 
420 425 430 

Gin Leu Thr Glu Val lie Gly Cys Thr Thr Val Thr Val Gly Ser Asp 
435 440 445 

Gly Asn Val Pro Val Pro Met Ala Gly Gly Leu Pro Arg Val Leu Tyr 
450 455 460 

Pro Thr Glu Lys Leu Ala Gly Ser Lys lie Cys Ser Ser Ser 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1455 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

CAT CAT AAT GGA ACA AAT GGT ACT ATG ATG CAA TAT TTC GAA TGG TAT 4 8 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
520 525 530 

TTG CCA AAT GAC GGG AAT CAT TGG AAC AGG TTG AGG GAT GAC GCA GCT 96 
Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
535 540 545 

AAC TTA AAG AGT AAA GGG ATA ACA GCT GTA TGG ATC CCA CCT GCA TGG 14 4 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
550 555 560 

AAG GGG ACT TCC CAG AAT GAT GTA GGT TAT GGA GCC TAT GAT TTA TAT 192 
Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
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565 570 575 

GAT CTT GGA GAG TTT AAC CAG AAG GGG ACG GTT CGT ACA AAA TAT GGA 24 0 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
580 585 590 595 

ACA CGC AAC CAG CTA CAG GCT GCG GTG ACC TCT TTA AAA AAT AAC GGC 2 88 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
600 605 610 

ATT CAG GTA TAT GGT GAT GTC GTC ATG AAT CAT AAA GGT GGA GCA GAT 3 36 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
615 620 625 

GGT ACG GAA ATT GTA AAT GCG GTA GAA GTG AAT CGG AGC AAC CGA AAC 3 84 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
630 635 640 

CAG GAA ACC TCA GGA GAG TAT GCA ATA GAA GCG TGG ACA AAG TTT GAT 4 32 

Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
645 650 655 

TTT CCT GGA AGA GGA AAT AAC CAT TCC AGC TTT AAG TGG CGC TGG TAT 48 0 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
660 665 670 675 

CAT TTT GAT GGG ACA GAT TGG GAT CAG TCA CGC CAG CTT CAA AAC AAA 52 8 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
680 685 690 

ATA TAT AAA TTC AGG GGA ACA GGC AAG GCC TGG GAC TGG GAA GTC GAT 57 6 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
695 700 705 

ACA GAG AAT GGC AAC TAT GAC TAT CTT ATG TAT GCA GAC GTG GAT ATG 62 4 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
710 715 720 

GAT CAC CCA GAA GTA ATA CAT GAA CTT AGA AAC TGG GGA GTG TGG TAT 6 72 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
725 730 735 

ACG AAT ACA CTG AAC CTT GAT GGA TTT AGA ATA GAT GCA GTG AAA CAT 72 0 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
740 745 750 755 

ATA AAA TAT AGC TTT ACG AGA GAT TGG CTT ACA CAT GTG CGT AAC ACC 76 8 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
760 765 770 

ACA GGT AAA CCA ATG TTT GCA GTG GCT GAG TTT TGG AAA AAT GAC CTT 816 
Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
775 780 785 

GGT GCA ATT GAA AAC TAT TTG AAT AAA ACA AGT TGG AAT CAC TCG GTG 864 
Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
790 795 800 

TTT GAT GTT CCT CTC CAC TAT AAT TTG TAC AAT GCA TCT AAT AGC GGT 912 
Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
805 810 815 

GGT TAT TAT GAT ATG AGA AAT ATT TTA AAT GGT TCT GTG GTG CAA AAA 960 
Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
820 825 830 835 

CAT CCA ACA CAT GCC GTT ACT TTT GTT GAT AAC CAT GAT TCT CAG CCC 1008 
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His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro, 
840 845 850 

GGG GAA GCA TTG GAA TCC TTT GTT CAA CAA TGG TTT AAA CCA CTT GCA 10 5 6 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
855 860 865 

TAT GCA TTG GTT CTG ACA AGG GAA CAA GGT TAT CCT TCC GTA TTT TAT 1104 
Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
870 875 880 

GGG GAT TAC TAC GGT ATC CCA ACC CAT GGT GTT CCG GCT ATG AAA TCT 1152 
Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
885 890 895 

AAA ATA GAC CCT CTT CTG CAG GCA CGT CAA ACT TTT GCC TAT GGT ACG 12 0 0 

Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
900 905 910 915 

CAG CAT GAT TAC TTT GAT CAT CAT GAT ATT ATC GGT TGG ACA AGA GAG 12 4 8 

Gin His Asp Tyr Phe Asp His His Asp lie lie Gly Trp Thr Arg Glu 
920 925 930 

GGA AAT AGC TCC CAT CCA AAT TCA GGC CTT GCC ACC ATT ATG TCA GAT 12 96 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
935 940 945 

GGT CCA GGT GGT AAC AAA TGG ATG TAT GTG GGG AAA AAT AAA GCG GGA 13 44 

Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
950 955 960 

CAA GTT TGG AGA GAT ATT ACC GGA AAT AGG ACA GGC ACC GTC ACA ATT 13 92 

Gin Val Trp Arg Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
965 970 975 

AAT GCA GAC GGA TGG GGT AAT TTC TCT GTT AAT GGA GGG TCC GTT TCG 14 4 0 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
980 985 990 995 

GTT TGG GTG AAG CAA TAA 14 58 

Val Trp Val Lys Gin 
1000 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 



88 



Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

lie Tyr Lys .Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 - 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp' His His Asp lie lie Gly Trp Thr Arg Glu 
405 410 415 
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Gly Asn Ser Ser His 
420 

Gly Pro Gly Gly Asn 
435 

Gin Val Trp Arg Asp 
450 

Asn Ala Asp Gly Trp 
465 



Pro Asn Ser Gly Leu Ala 
425 

Lys Trp Met Tyr Val Gly 
440 

lie Thr Gly Asn Arg Thr 
455 

Gly Asn Phe Ser Val Asn 
470 475 



Thr lie Met Ser Asp 
430 

Lys Asn Lys Ala Gly 
445 

Gly Thr Val Thr lie 
460 

Gly Gly Ser Val Ser 
480 



Val Trp Val Lys Gin 
485 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 



Val 


Asn 


Gly 


Thr 


Leu 


Met 


Gin 


Tyr 


Phe 


Glu 


Trp 


Tyr 


Thr 


Pro 


Asn 


Asp 


l 








5 










10 










15 




Gly 


Gin 


His 


Trp 


Lys 


Arg 


Leu 


Gin 


Asn 


Asp 


Ala 


Glu 


His 


Leu 


Ser 


Asp 








2 0 










25 










30 






lie 


Gly 


lie 


Thr 


Ala 


Val 


Trp 


lie 


Pro 


Pro 


Ala 


Tyr 


Lys 


Gly 


Leu 


Ser 






35 










40 










45 








Gin 


Ser 


Asp 


Asn 


Gly 


Tyr 


Gly 


Pro 


Tyr 


Asp 


Leu 


Tyr 


Asp 


Leu 


Gly 


Glu 




50 










55 










60 










Phe 


Gin 


Gin 


Lys 


Gly 


Thr 


Val 


Arg 


Thr 


Lys 


Tyr 


Gly 


Thr 


Lys 


Ser 


Glu 


65 










70 










75 










80 


Leu 


Gin 


Asp 


Ala 


lie 


Gly 


Ser 


Leu 


His 


Ser 


Arg 


Asn 


Val 


Gin 


Val 


Tyr 










85 










90 










95 




Gly 


Asp 


Val 


Val 


Leu 


Asn 


His 


Lys 


Ala 


Gly 


Ala 


Asp 


Ala 


Thr 


Glu 


Asp 








100 










105 










110 






Val 


Thr 


Ala 


Val 


Glu 


Val 


Asn 


Pro 


Ala 


Asn 


Arg 


Asn 


Gin 


Glu 


Thr 


Ser 






115 










120 








125 








Glu 


Glu 


Tyr 


Gin 


He 


Lys 


Ala 


Trp 


Thr 


Asp 


Phe 


Arg 


Phe 


Pro 


Gly 


Arg 




130 










135 










140 










Gly 


Asn 


Thr 


Tyr 


Ser 


Asp 


Phe 


Lys 


Trp 


His 


Trp 


Tyr 


His 


Phe 


Asp 


Gly 


145 










150 










155 










160 


Ala 


Asp 


Trp 


Asp 


Glu 


Ser 


Arg 


Lys 


He 


Ser 


Arg 


He 


Phe 


Lys 


Phe 


Arg 










165 










170 










175 




Gly 


Glu 


Gly 


Lys 


Ala 


Trp 


Asp 


Trp 


Glu 


Val 


Ser 


Ser 


Glu 


Asn 


Gly 


Asn 








180 










185 










190 






Tyr 


Asp 


Tyr 


Leu 


Met 


Tyr 


Ala 


Asp 


Val 


Asp 


Tyr 


Asp 


His 


Pro 


Asp 


Val 






195 










200 










205 








Val 


Ala 


Glu 


Thr 


Lys 


Lys 


Trp 


Gly 


He 


Trp 


Tyr 


Ala 


Asn 


Glu 


Leu 
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